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Abstract For standard eigenvalue problems, a closed-form expression for the 
condition numbers of a multiple eigenvalue is known. In particular, they are 
uniformly 1 in the Hermitian case, and generally take different values in the 
non-Hermitian case. We consider the generalized eigenvalue problem and iden- 
tify the condition numbers of a multiple eigenvalue. Our main result is that 
a multiple eigenvalue generally has multiple condition numbers, even in the 
Hermitian definite case. The condition numbers are characterized in terms of 
the singular values of the outer product of the corresponding left and right 
eigenvectors. 
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1 Introduction 

This paper is concerned with the sensitivities of a multiple eigenvalue in gen- 
eralized eigenvalue problems Ax = XBx. When perturbations are introduced 
in the matrices, a multiple eigenvalue Ao of multiplicity r generally splits into 
r simple eigenvalues. We are interested in how sensitively each of the r eigen- 
values can be perturbed. 

The first order perturbation expansion for a simple eigenvalue is a well- 
known result [9,10 , and that for a multiple eigenvalue is also studied in [T2] 
for standard eigenvalue problems, and in [1] for generalized eigenvalue prob- 
lems, including singular matrix pairs. Using such results and following the 
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definition of r condition numbers of a multiple eigenvalue for standard eigen- 
value problems introduced in [13j . we naturally define condition numbers for 
the generalized shown below in (j4]). 

Sun shows in [Tj5] that the r condition numbers Ki(A, Ao) for i = 1, . . . , r 
of a nondefective multiple eigenvalue of a non-Hcrmitian matrix are expressed 



where Cj(A, Xo) are the secants of the canonical angles between the left and 
right invariant subspaces corresponding to the multiple eigenvalue. When A 
is non-Hermitian Ci(A, Ao) generally take different values for different i, hence 
so do Ki(A, Xq) and (JTJ) shows that a multiple eigenvalue has multiple con- 
dition numbers. Contrarily, in the Hermitian case we have Cj(A, Ao) = 1, so 
Ki(A, Ao) = 1 for i = 1, . . . , r. Hence §Q also shows the well-known fact that 
a multiple eigenvalue of a Hermitian matrix always has a uniform condition 
number 1. 

Since a standard non-Hermitian eigenvalue problem can be regarded as a 
special case of the generalized non-Hermitian eigenvalue problem Ax — XBx, 
it is clear that a multiple eigenvalue in this case must have multiple condition 
numbers. On the other hand, in the important case of the generalized Hermi- 
tian definite pair (where A and B are Hermitian and B is positive definite), it 
is not trivial whether or not the condition numbers Ki(A, B, Xq) for i = 1, . . . , r 
take different values. In this paper we identify their expression, which shows 
that they generally do take r different values. We shall see that there are two 
sources for this different conditioning, namely the difference between the left 
and right eigenvectors (as present in non-Hermitian standard eigenproblems), 
and the fact that the f?-orthonormal eigenvectors have different 2-norms (as 
present in the case B ^ I). 

It is important to note that in the Hermitian definite natural choice 

of metric can be based on the B-based inner product (x, u)b — % By instead 
of the standard inner product (x, y) = x H y. This leads to the standard eigen- 
value problem for the Hermitian matrix B~ 1 / 2 AB~ 1 / 2 , so in this inner product 
all the condition numbers are the same (see the second remark in section [32]). 
Our discussion in this paper assumes the use of the standard inner product. 

The condition numbers Ki for i = 1, . . . , r of a nondefective finite multiple 
eigenvalue in four situations are summarized in Table [TJ expressed in terms 
of <7i,...,0v, the r positive singular values of X{Y^ (see section HOj) . The 
contribution of this paper is that we fill in the second row, that is, we identify 
the condition numbers of a multiple eigenvalue in a generalized eigenvalue 
problem, both for the Hermitian definite case and the non-Hermitian case. 
Here r is a prescribed positive constant that accounts for perturbation scalings, 
see section HOI 

There are a number of related studies in the literature. [1] investigates the 
Holder condition number, which is essentially K\ in our terminology when Aq is 
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for i — 1 , . . . , r, 
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Table 1 Summary of condition numbers Ki of a multiple eigenvalue for i = 1, . . . , r. 





Hcrmitian 


Non- Hcrmitian 


Ax = Xx 


1 




Ax = XBx 


(1 + t|Ao|) mini<j<j y/(TjO-i-j +1 


(l + rlAol)^!^) 1 ' 



nondefective. The focus of [3] is the effect of the structure of the perturbation 
on the Holder condition number, and in section I4TT1 we discuss how our results 
are related to those in [4]. 

An observation that a multiple generalized eigenvalue has different sensi- 
tivities under perturbations was first made in [TU1 p. 300], which mentions that 

i i • i a T2000 01 

a multiple eigenvalue of a pair such as A - 
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tends to 



behave differently under perturbations in A and B. We note that as shown in 
|14) . for Hermitian definite pairs, small componentwise relative changes in A 
and B can introduce only small relative perturbation to any eigenvalue, and 
it is easy to see the two eigenvalues of the above pair {A, B) have similar per- 
turbation behaviors. However, in terms of "standard" normwise perturbation, 
that is, when (A, B) is perturbed to (A + eE,B + eF) under \\E\\ 2 , \\F\\ 2 < 1 
and e — >■ 0, a multiple eigenvalue can exhibit different behaviors. jTJ[S] consider 
the Hermitian definite case and give an explanation for this behavior, present- 
ing r different perturbation bounds for Ao under perturbations of finite norm. 
The approach of this paper is different in that our focus is on the condition 
numbers, which are attainable perturbation bounds in the first order sense in 
the limit E, F — > 0. The bounds in [7l[5] are valid for non-asymptotic E, F but 
are less tight (generally not attainable) when E, F —> 0. 

Our arguments closely follow that of |13j . in which the condition numbers 
are called worst-case condition numbers, to emphasize the difference from 
the typical-case condition numbers, as presented in [11] , In this sense, our 
results should also be regarded as worst-case condition numbers, in that m are 
the largest attainable bounds in the first order sense. Experiments show that 
these bounds are not likely to be attained in practice for randomly generated 
perturbations, especially for large i (see the example in section ETTj) . 

The structure of this paper is as follows. First in section [5] we establish 
a characterization of condition numbers of a multiple generalized eigenvalue. 
In section [3] we treat the Hcrmitian definite case and show that a multiple 
generalized eigenvalue indeed has multiple condition numbers. Section [4] treats 
the general non-Hermitian case, in which we give simple numerical examples 
to illustrate our results. We also discuss the case of an infinite and defective 
multiple eigenvalue. 

Notations: <Ji(X) denotes the ith largest singular value of a rectangular 
matrix X . Xi(A) denotes the ith eigenvalue of A, arranged in decreasing order 
of magnitude. We use only the matrix spectral norm || • ||2, so that 1 1 ^4 1 1 2 = 



1 



<Ti(A). We use MATLAB notation, in which V(i,j : k) denotes the jth to kth 
elements of the ith row of V. Ik is the identity matrix of order k. 



2 Condition numbers of a multiple generalized eigenvalue 

2.1 Definition 

For an n-by-n matrix pair (A, B), suppose that Ao is a nondefective finite 
multiple eigenvalue (we discuss the infinite and defective cases later in section 
I4.3[) of multiplicity r, so that there exist nonsingular matrices X = {Xi,X-z) 
and Y = (Yi, Y 2 ) with X U Y X € C nxr that satisfy 



Y H AX 



X I r 
J A 



Y H BX 



I r 
J B 



(2) 



Here the spectrum of the pair (Ja,Jb) does not contain Ao- Then, the pen- 
cil (A + eE, B + eF) has eigenvalues Ai, A2, . . . , A r admitting the first order 
expansion [512] 



Ai = X + X i (Y 1 H (E-X F)X 1 )e + o(e), 1 = 1,2, 



(3) 



where X^Y* 1 (E ~ X F)X 1 ) are the eigenvalues of Yf(E - X F)X 1 for i 
I ..... r. In light of §S§ and following the definition presented in [T3], we define 
r condition numbers Ki(A, B, Ao) for i = 1, . . . , r of the multiple eigenvalue Ao 
as follows. 

Definition 1 Let an n-by-n matrix pair (A,B) have the decomposition 
and let r > be a prescribed constant. We define the condition numbers of 
Ao, a multiple eigenvalue of (A, B) of multiplicity r, by 



Kl (A,£,A )= sup \X l (Y 1 H (E-X F)X 1 )\, i = l,. 

||B||2<l,||F|| 2 <r 



(4) 



where the eigenvalues Xi(Y-^ (E — XqF)Xi) are ordered such that |Ai(Y 1 ff (E — 
X F)X 1 )\ > iX^YfiE - X F)Xi)\ >■■■> \X r (Y 1 H (E - X F)X 1 )\. 

In words, Ki{A, -B, Ao) measures by how much small changes in A and B 
can be magnified in the multiple eigenvalue Ao, in the first order sense, r is 
a positive constant that allows for the case where perturbations in A and B 
occur in different magnitudes, which is a notion adopted for example in [3]. 



2.2 Equivalent characterization of Ki(A, B, Ao) 

Here we show that Ki{A, B, Xq) can be expressed in an equivalent form, just as 
in the standard (B = I n ) case Ki(A, Xq) can be expressed as |T]) using the se- 
cants of the canonical angles Cj(A, Xq) between the left and right invariant sub- 
spaces corresponding to Aq. Note that Cj(A, Xq) — dj{X{Y-l I ) for j = 1, . . . ,r. 
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In this paper we use the quantity aj(X\Yi) instead of the canonical angles 
to identify the condition numbers, because it allows us to treat generalized 
eigenvalue problems in a uniform way. 

We use the proof of Theorem 2.1 in [T3], whose crucial identity is 

\XiiYfiE - \ F)Xx)\ = \Xi((E - X F)X 1 Y 1 H )\ = \Xi((E - X F)UEV H )\ 
= \\i{V H {E-\ F)UE)\ 

for i = l,...,r, where X X Y^ = USV H is the "thin" SVD. Here, to get the 
first and last equalities we used the fact [101 P-27] that for general X G C" xm 
and Y € C mx ", the nonzero eigenvalues of XY and those of YX are the 
same. Since V and U have orthonormal columns and E, F can take arbitrary 
matrices with \\E\\ 2 < 1, ||F|| 2 < r, it follows that V H EU 1 V H FU can also take 
arbitrary matrices such that ||y H i?[/|| 2 < 1, || V H FU\\2 < t. Hence, redefining 
E := V H EU and F :— V H FU, we see that the condition numbers Hi{A, B, Ao) 
have the following equivalent characterization. 

Lemma 1 Under the assumptions in Definition [7J suppose that XiY± = 
USV H is the SVD where U — diag(ai) is r-by-r {o~i > <j 2 > . . . > cr r > 0). 
Then, Ki(A, B,Xo) in (T?J) can be expressed as 

Ki(A,B, A ) = sup \Xi(E(E- X Q F))\, i = l,...,r. (5) 

l|S||2<l,||F|| 2 <r 

Here we have a r > because both X\ and Y\ have full column-rank. Note 
that the size of E and F in ([5]) is r-by-r, which is smaller than n-by-rt as in 
(g|). In the sequel we use the expression (JS|) to identify Ki(A,B, Ao). 

3 Hermitian definite pairs 

3.1 Specifications 

When (A, B) is a Hermitian definite pair, all the eigenvalues are always real 
and nondefective, and there exists a nonsingular matrix X such that [5] 

X H BX = /„, (6) 

where Ai is a diagonal matrix containing the eigenvalues not equal to Ao- 
Hence the diagonals of £ in §5§ are the r positive singular values of the matrix 
X\Xi , which are equal to the eigenvalues of the matrix X^X\. Since (A, B) is 
a Hermitian definite pair it is natural to require that the perturbation matrices 
preserve the property, so §5§ becomes the "structured" condition numbers 
Ki(A, B, Ao; S), expressed by 

Ki(A,B,X ;S)= sup \Xi(S(E - X F))\, i = l,...,r. (7) 

[|B|| 2 <l,||i i, ||2<T 
E=E H ,F = F H 



X H AX 



X I r 

A x 
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Denoting D = S 1 / 2 — diag( v / erj", . . . , -^ov), we see that the eigenvalues of 
£{E — XoF) are equal to those of the Hermitian matrix D(E — XoF)D. 

We further observe that E — XqF can represent an arbitrary Hermitian 
matrix H with ||-ff||2 < 1 + t|Ao|, which can be done by letting E = H/\\H\\2 
and F = —tE\\q\/ \q. Conversely, it is easily seen that the class of Hermi- 
tian matrices H with ||i?||2 < 1 + t|Ao| includes all the matrices expressed 
by E — XoF. Together with the fact that the singular values of a Hermitian 
matrix are simply the absolute values of the eigenvalues, we have yet another 
characterization of condition numbers in the Hermitian definite case 

K i {A,B,X ;S) = (l + r|A |) sup a l {DHD), i = l,...,r. (8) 

I|H||2<1 
H = HH 

3.2 Identifying the condition numbers 

Now we are ready to identify the condition numbers Ki(A, B, Ao; S) using the 
expression ([5]). 

Theorem 1 In the Hermitian definite case, Ki(A, B, Xq',S) as in ([7 |) .([8 ]l is 
Ki(A,B, A ;§) = (1 + r|A |) min Jo^Zf~[, i = l,...,r. (9) 

Note that ^JaJoi^J+i is the geometric mean of o~j and Ui-j+i, the jth largest 
and smallest of {a\, . . . , <Tj), which is the set of the i largest singular values 
of XfX x . 

Proof In view of ([8]) , to prove the theorem it suffices to prove that for any 
Hermitian H such that \\H\\2 = 1, ai(DHD) is bounded above by mm, ^jOjTk-j+i 
for i = 1, . . . , r, and that this bound is attainable. 

First, proving attainability is simply done by considering the case where 
H is zero except for its i X i leading principal submatrix, which is set to 
the antidiagonal matrix (which has 1 on the antidiagonals and elsewhere). 
This choice of H makes the i x i leading principal submatrix of DHD also 
an anti-diagonal matrix, whose jth antidiagonal is ^/Uj ai-j+±. Hence we have 
ai(DHD) = min,- y /a ] a l -j +1 . 

Our remaining task is to prove that min, ^WJWi^J+i is an upper bound of 
ai(DHD) for any Hermitian H with ||-ff||2 < 1- Using the max-min charac- 
terization of singular values 8, p. 68], we have 

o-ADHD) = max min \\DHDQv\\ 2 , 
Q H Q=Ii\\v\\ a =l" 

so it suffices to show that for any Q G C rxi with orthonormal columns, there 
exists a unit vector v such that \\DH DQv\\2 < min^ ^aJoiZJ^i. 

To prove this, let jo = argmin J< ^ +1 ) /2^/< J j <J i-j+i ■ Since for any Q we 
have rank(Q(l : i — jo, :)) < i — jo, there are at least jo linearly independent 
vectors in C ixl that are orthogonal to the rows of Q. Therefore there must 
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exist P £ C lX ia w ith orthonormal columns such that the first i — jo rows of 
the r-by-jo matrix QP are all zeros. For such P, we have 

\\DQP\\ 2 = ||diag(0, . . . , 0, V^i-jo+i. ■ • • i \fc)QP\\i < V^-io+i- 

Furthermore, since rank(PPQP(l : jo — 1, :)) < jo — 1, there must exist a 
unit vector w £ O oXl that is orthogonal to HDQP(\ : jo — 1, :), so that the 
first jo — 1 rows of HDQPw are all zeros. We easily see that for such w we 
have \\DHDQPw\\2 < y<Oj <Ji-j 0+ i. Therefore we have shown that for any 
Q e C fcX i with orthonormal columns there exists a unit vector vo = Pw such 
that 

min \\DHDQv\\ 2 < \\DHDQv a \\ 2 < Joj a Oi- Ja+ i = mSjxJaw^j+i- 

|| U || 2 = 1 J 

□ 

Three remarks are in order. 

— When B ^ I ni Ci for i — 1, . . . , r generally take different values, so (J9j) 
shows that a multiple generalized eigenvalue has multiple condition num- 
bers, which is our main result. Note that the ratio among the condition 
numbers is bounded by Ki{A, P, Ao; S)/k t (A, P, Ao; S) < a\ja r . Now since 
o- m in(P _1 ) < o r < cti < ^^(p- 1 ), we have a\ja T < (T max (P)/cr min (P) = 
k 2 (B), the standard 2— norm condition number of P. It follows that if P 
is well-conditioned then a multiple eigenvalue of a Hermitian definite pair 
must have similar condition numbers. 

— For standard Hermitian eigenvalue problems (P = I n ), we have di = 1 
and t = 0, so (O reduces to Ki — 1 for all i, the well-known result that a 
multiple eigenvalue of a Hermitian matrix has a uniform condition number 
1. When one allows for perturbation in P = I n , the condition numbers are 
l+r|Ao| regardless of the multiplicity. The second term suggests that larger 
changes occur in larger eigenvalues. This observation can be summarized as 
follows: for perturbation in A, all the eigenvalues have the same sensitivity 
in the absolute sense, while for perturbation in P, all the eigenvalues have 
the same sensitivity in the relative sense. 

— The above arguments show that the difference among condition numbers of 
a multiple eigenvalue is due to the difference among the r singular values of 
-XiYj , the outer product of the left and right eigenvectors corresponding to 
Ao- <Ji (XiYj ) are all 1 in the standard Hermitian case because X\ = Y\ and 
it is an orthogonal matrix. In the standard non-Hcrmitian case X\ ^ Y\ 
and neither is orthogonal, so Ui{X\Yy) take r different values. In the 
generalized Hermitian case we have X\ = Y\ but X\ is not orthogonal, so 
<Ji{XiXi) again take r different values. Note that one uses the P-based 
inner product this difference disappears because X\ is P-orthogonal, recall 
the remark in the introduction. 
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4 Non-Hermitian pairs 

Here we consider the case where (A, B) is a general non-Hermitian pair. In 
view of ([3]), our task is to bound |A-j(Xi?)| for an arbitrary square matrix 
X such that ||X||2 < 1. This is in fact the exact same problem addressed in 
[T51 Thm.3.1]. Hence the analysis there can be directly applied to yield the 
following result. 

Theorem 2 For a non-Hermitian pair (A, B) that satisfies ([2]), let <j\ > a% > 
. . . > o~ r > be the positive singular values of the matrix X^Y^ 1 . Then, 
Ki(A,B,\o) in §5§ can be expressed as 



4.1 Structured perturbation 

It is instructive to revisit the Hermitian definite case, but now allowing for 
non-Hermitian perturbations, that is, E, F are general matrices whose norms 
are bounded by 1. In this case, the condition numbers Ki(A, B, Ao) have the 
characterization ([5]) (instead of ([5])), so they have the expression (ITUl) . the 
same as that for the non-Hermitian pair. 

As might be expected, the condition number under Hermitian perturbation 
(121) is always no larger than that under a non-Hermitian perturbation (|1U[): 



The above arguments imply that if the singular values of X{Yy are the 
same, then under a general non-Hermitian perturbation the condition numbers 
Ki(A, B, Ao) are all the sam^, regardless of whether (A, B) is Hermitian def- 
inite or non-Hermitian. Therefore, the structure of the perturbation matrices 
plays an important role in the perturbation sensitivity of a multiple general- 
ized eigenvalue. We note that the standard Hermitian case with B = I is an 
exception, in which the condition numbers are always all 1 whether or not the 
perturbation matrices are Hermitian. 

This point of view, to focus on the effect of the structure of the perturba- 
tion, was investigated extensively in [1], in which (Theorem 4.5) it is shown 
that (among other structures they consider) the Hermitian structure of the 

1 In fact the entire first order perturbation expansions become the same. 




(10) 
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perturbation matrices does not have any effect on the Holder condition num- 
ber. 

At first sight this seems to contradict our results, which show that the 
Hermitian structure of the perturbation matrices does affect the condition 
numbers of the multiple eigenvalue Ao- The explanation is that [\ treats only 
the Holder condition number, which is equivalent to Ki(A, B, Ao) in the non- 
defective case. Here we are identifying individual condition numbers of each 
of the r eigenvalues. In fact, we can see that for i = 1, m in and (flO|) are 
the same, both equal to (1 + t|A |)cti. We can easily see that they are equal 
also for i = 2. The difference between © and (|10p starts to take effect only 
for i > 3, so Ao's multiplicity r must be at least 3. In particular, for a simple 
eigenvalue the Hermitian structure of the perturbation has no effect on the 
condition number, which is a trivial consequence of the results in [4]. 



4.2 Examples 

Here we present two simple examples to illustrate the above results and ob- 
servations. 

Example 1 For the Hermitian definite pair A = 

presented in [TU1 p. 300], we have ki(A, B, X ; §) = ki(A, B,\ ) — 3 and 
k,2{A, B, Ao; S) = K2(A, B, X ) — 3/V1000, which explains why the multiple 
eigenvalue Ao = 2 has different sensitivities. Note that in this case the struc- 
ture of the perturbation has no effect on the condition numbers, because the 
multiplicity of Ao is r < 3. 

Example 2 We consider a 4-by-4 Hermitian definite pair (A, B) expressed by 

A = W H AW, B = W H W 1 

where A — diag(1, 1, 1, 2) and W = diag(l, 2, 100, 1), so the eigenvalues of 
(A, B) are 1, 1, 1, 2. Since X that diagonalizes A, B (as in (0) is X = W^ 1 = 
diag(l, 0.5, 0.01, 1) and X\ is its first three columns, the singular values of 
X\Xi are o\ = 1 2 ,<72 = 0.5 2 ,<73 = 0.01 2 (where in this example we let 
r = 1), hence by © it follows that Ki(A, B, 1; S) = 2, k 2 (A, B, 1;S) = 1 and 
K 3 (A,B, 1;S) = 0.02. Using MATLAB version 7.10 we generated 10 6 sets of 
random Hermitian perturbation matrices E and F such that ||.E/||2> \ < 1, 
and examined the behavior of the three eigenvalues of the pair (A+eE, B + eF) 
that are closest to Ao = 1, where we let e = 10 -5 . Specifically, denoting by Ai 
for i = 1,2,3 the three eigenvalues of (A + eE, B + eF) that are closest to 1 
such that |Ai — 1| > | A2 — 1 1 > | A3 — 1|, we examine how large |A; — 1 1 /e can 
be for i = 1, 2, 3. 

We also experimented with non-Hermitian perturbations, in which case 
we let E,F be arbitrary non-Hermitian matrices with 1 1 £7 1 1 2 , \\F H2 < 1. In 
this case the condition numbers (|10p are ki(A, B, 1) = 2, K2(A, B,l) = 2(1 • 
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0.5 2 ) 1 / 2 = land k 3 (A,B,1) = 2(1-0. 52-0.01 2 ) 1 / 3 ~ 0.058, in which we confirm 
that the first two are the same as in the above Hermitian case. 

Lastly, in order to see how the Hermitian property of the matrices plays 
a role in the eigenvalue perturbation behaviors, we also tested with a non- 
Hermitian pair (A, B) that has the same eigenvalues and <x; (of XiY^ 1 ) as 
the above Hermitian pair. We formed such a pair (A, B) by defining A = 
Y- H AX- 1 and B = Y^X- 1 , where A = diag(l, 1, 1, 2), (the first 3 
rows of Y) is set to ZEV H and X\ (the first 3 columns of X) is set to UZ^ 1 , 
where U and V are randomly generated matrices with orthonormal columns, 
£ = diag(eri, o"2, 03) = (l 2 , 0.5 2 , 0.01 2 ) and Z is an arbitrary nonsingular 
matrij|l- Elements of the last row of Y and the last column of X were taken 
as random numbers. Since we have X{Y^ = U£V H , we have Kx(A,B, 1) = 
2, K2(A, B, 1) = 1 and k%{A,B, 1) = 0.058, the same condition numbers as 
the above second case with non- Hermitian perturbation, as was intended. The 
perturbations E and F are taken as arbitrary non- Hermitian matrices. 

In summary we tested under three different situations, all of which have 
the same Oi (XxYf): (i) Both (A, B) and (E, F) are Hermitian (shown as "Her 
+ Her" in Tabled, (ii) (A, B) is Hermitian but (E, F) is non-Hermitian ("Her 
+ NonHcr"), and (iii) Both (A, B) and (E,F) are non-Hermitian ("NonHer 
+ NonHer"). 

The results are summarized in Table [5] below, which shows the average 
and maximum (shown as avg. and max respectively) values of AXi/e — |Aj — 
l|/e among the 10 6 runs with randomly generated E and F, along with the 
condition numbers Ki(A, B, Xq) (which are first order upper bounds for AXi/e) 
for i = 1,2,3. 



Table 2 Average and maximum perturbation A\i/e of 10 6 runs for i = 1, 2, 3. 





Her + Her 


Her 


+ NonHer 


NonHer + NonHer 


i 


avg. 


max 




avg. 


max 




avg. 


max 




1 


0.579 


1.98 


2.0 


0.41 


1.86 


2.0 


0.42 


1.90 


2.0 


2 


0.141 


0.84 


1.0 


0.136 


0.76 


1.0 


0.137 


0.76 


1.0 


3 


0.00018 


0.012 


0.02 


0.00021 


0.027 


0.058 


0.00021 


0.027 


0.058 



We make the following observations. 

— We confirm that /t, is an upper bound of max AXi/e for all i in all three 
cases (which is necessarily true in the limit e — > 0). Interestingly, for i = 1 
the bound Ki is nearly attained while for i = 2,3, max AXi/e is noticeably 
smaller than , which suggests that for larger i it becomes more and more 
rare that the largest-possible perturbation is attained. 

— Reflecting the fact that re, are the same for all the three cases for i = 1 and 
2, we can see that max AXi/e axe similar in all three cases, so two eigen- 
values have similar maximum sensitivities regardless of whether A, B, E, F 



2 Note that the choice of Z does not affect the condition numbers Ki(A, B, 1). 
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are Hermitian or not. On the contrary, maxZ\Ai/e for i = 3 show the 
somewhat different sensitivities of the third eigenvalue depending on the 
structure of E, F. 

— The behavior of the multiple eigenvalue is remarkably similar for the latter 
two cases, not only in terms of maxZ\Ai/e but also avg.AXi/e. This reflects 
the fact that the first order expansions of Ao are the same for the two 
cases, so that the local behavior of an eigenvalue is determined solely by 
the singular values of X\Y-y* , and does not depend on the structure of the 
matrices A and B. 

— Comparing avg.Z\Ai/e with maxZ\Ai/e, we see that the former is much 
smaller than the latter for larger i. For i = 1 the difference seems less 
significant. 

A precise explanation for the last two observations, which necessarily involves 
statistical analysis, is an open problem: our discussions deal only with the 
maximum attainable perturbation maxZ\Ai/e, not with avg.AXi/e. 



4.3 Defective and infinite cases 

So far we have treated only the case where Ao is a finite and nondefective 
multiple eigenvalue. Here we briefly consider the cases where Ao is infinite 
and/or defective. 

The case Ao = oo can be treated as in [TJ[3] simply by considering the 
multiple zero eigenvalue of the pair [B, A), for which the exact same discussion 
as above is valid. 

When Ao is defective, Lidskii's perturbation theory [TJ|3] shows that the 
leading term in Ao's perturbation expansion is not linear in e. Specifically, 
if Ao is an eigenvalue of (A, B) of multiplicity ri\r belonging to a Jordan 
block of dimension n\ repeated r times, then there are nyr eigenvalues of 
(A + eE, B + eF) admitting the expansion 

Xi,i = A + (X i (Y l H (E-X Q F)X 1 )) 1/m e 1 /"! +0 (e 1 /«i ) 

for i = l,2,...,r and t = 1,2,..., nr. Here Yf e C rxn and X 1 e C" xr 
represent the linearly independent left and right eigenvectors of ( A, B) cor- 
responding to Ao, and the value (^Xi(Y^ I (E — AoF)Xi)) takes all the n\ 
distinct nith roots. 

We observe in (flT]) that although the leading exponent is different from 
that in ([3]), the sensitivities of the multiple eigenvalue are still governed by 
\Xi{Yy(E — Ao-F)Xl)| for i = l,...,r, for which we gave a bound in the 
above discussions. Hence all our previous results carry over to this case, and 
the condition numbers of Ao with the exponent l/fii, which we define by the 
theoretical bounds for supi| B || 2<1 ||fI| 2 <t ^(^i^-^ — X F)Xi)\ 1 / ni , are 

l/ni 

/ i \ \ 

Kt l7ll (A, B,X ) 
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By (fTTj) . we must have |A»^ — Aol/e 1 /" 1 < Ki^ ni (A, B, X ) for i = 1, . . . , r in 
the limit e — > for any £7 and F. Note that Ki ini (A, B, Ao) does not depend 
explicitly on £. We observe that in the defective case m > 2, the exponent 
1/rti makes the difference among the condition numbers less significant than 
in the nondefective case. See the example below for an illustration. 



4-3.1 Example 

To examine the behavior of a defective multiple eigenvalue, we generate a 
7-by-7 pair (A, B) defined by 



A = Y 



-H 



J 



X-\ and B = Y- H X- 1 



(12) 



where J 



1 1 

1 



is a 2-by-2 Jordan block. (A, B) has a multiple eigenvalue 



Ao = 1 of multiplicity six and a simple eigenvalue 2. Yf 1 = [Y(:,2) F(:,4) Y(: 
,6)] H = ZEV H and X 1 = [X(:,l) X(:,3) X(:,5)] = UZ- 1 are the left and 
right eigenvectors corresponding to Ao, where U and V are random matrices 
with orthonormal columns and Z is an arbitrary nonsingular matrix. The other 
rows of Y H and columns of X do not affect the condition numbers of Ao, so 
we let them take random values. We let E = diag(l 2 , 0.5 2 , 0.01 2 ), so that 
(Ti(XiYi) take the same values as in the non-Hermitian case of the second 
example in section B~2"1 

Recall from (TTT|) that perturbation in (A, B) generally makes Ao split into 
n\r perturbed eigenvalues Xi t e for i = l,...,r and £ = 1, ...,ni. (jlip also 
shows that for a fixed i, |A^ — A | must be nearly equal for all I up to o(e 1 /™ 1 ). 
For the matrix pair (1121) we have r — 3 and ni = 2, so we separate the six 
eigenvalues into three groups according to the value of i, so that the 
two eigenvalues of the ith group have perturbation sensitivity governed by 
\X i (Y 1 H (E-X F)X 1 )\ 1/ni . 

With r = 1, the condition numbers Ki^,(A,B, 1) for the «th group for i = 

1, 2, 3 are K h2 (A, B, 1) = (2 • I) 1 / 2 = y/2, k 2 , 2 {A, B, 1) - (2 • (1 • Q.h 2 ) 1 ' 2 ) 1 ' 2 = 

1 and K 3<2 (A, B, 1) = (2 • (1 • 0.5 2 • 0.01 2 ) 1 / 3 ) 1/2 ~ 0.24. Comparing these with 
Ki(A, B, 1) in the example in section I4T21 we see that that although a^XiY^ 1 ) 
take the same values, the relative difference among the condition numbers is 
smaller here, due to the exponent 1/2. 

Recalling that we must have — 1 1 / e 1 / 2 < Ki j2 {A, B, 1) for small e, here 
we examine how large |A^ — 1 1 /e 1 / 2 becomes for i = 1,2, 3. To do this, of the 
six eigenvalues of (A + eE, B + eF) close to Ao, we check the perturbation of 
the most perturbed, third perturbed, and the fifth perturbed ones. 
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In the experiment we let E, F be random non-Hermitian matrices with 
H-E-lb, \\F\\2 < 1, let e = 10~ 6 and tested with 10 6 pairs. In table[3]we report 
the average and maximum values of |Aj^ — 1 1 / e 1 / 2 for i = 1, 2, 3. 

Table 3 Defective matrix pair i\ '2] i with three 2x2 Jordan blocks, average and maximum 
perturbation $ i>t - ll/e 1 / 2 of 10 6 runs for i = 1,2, 3. 



i 


avg. 


max 


Ki,2 


1 


0.511 


1.21 


1.41 


2 


0.282 


0.733 


1 


3 


0.0089 


0.138 


0.24 



Similarly to the experiments in section 221 for nondefective multiple eigen- 
values, we see that a defective multiple eigenvalue also exhibits different sen- 
sitivities under perturbation. We also tested with Hermitian perturbations 
E = E H and F = F H , and obtained nearly the same results as in Table [2] 
This suggests that the structure of the perturbation does not play a role here. 

In all our experiments we had |A^i — Ai^l/e 1 ^ 2 < 0.04 for i = 1, 2, 3, which 
matches the theoretical result indicated by (jTTJ) that for a given i, \Xi.e — 1| 
are equal up to o(e 1 /" 1 ) for all £. 

Finally, a comparison between Tableland the third case of Table[2]suggests 
that the relative difference among the multiple eigenvalues is smaller in the 
defective case, reflecting the last remark before this example. 

Acknowledgements I thank the referees for their helpful suggestions, which improved the 
paper considerably. 
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